All tags

#Inference Optimization

10 articles

Tech 9 min

TRACER trains a surrogate from LLM classification API logs and swaps in via a parity gate